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Abstract.  Considering  the  metric  space  of  extended  real- valued  lower  semicontinuous  functions 
under  the  epi-distance,  the  paper  gives  an  upper  bound  on  the  covering  numbers  of  bounded  subsets 
of  such  functions.  No  assumptions  about  continuity,  smoothness,  variation,  and  even  finiteness  of  the 
functions  are  needed.  The  bound  is  shown  to  be  nearly  sharp  through  the  construction  of  a  set  of 
functions  with  covering  numbers  deviating  from  the  upper  bound  only  by  a  logarithmic  factor.  The 
analogy  between  lower  and  upper  semicontinuous  functions  implies  that  identical  covering  numbers  hold 
for  bounded  sets  of  the  latter  class  of  functions  as  well,  but  now  under  the  hypo-distance  metric. 
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1  Introduction 

Covering  numbers  of  classes  of  functions  play  central  roles  in  parts  of  information  theory,  statistics,  and 
applications  such  as  machine  learning;  see  for  example  [26,  16] .  A  large  variety  of  results  are  available. 
The  pioneering  work  [17,  11]  deal  with  continuous  and  smooth  functions;  see  [19]  for  a  recent  discussion. 
Functions  of  bounded  variation  are  considered  in  [7]  and  analytic  functions  in  [13].  An  upper  estimate 
for  the  covering  numbers  of  the  unit  ball  of  Gaussian  reproducing  kernel  Hilbert  spaces  is  given  in  [29], 
with  further  refinements  and  applications  in  [27,  18].  Covering  numbers  of  sets  of  convex  functions  are 
established  in  [14,  12],  with  significant  improvements  in  [15]. 

In  this  paper,  we  provide  upper  bounds  on  the  covering  numbers  of  bounded  subsets  of  extended 
real-valued  lower  semicontinuous  (lsc)  functions  on  lRd  under  the  epi-distance  metric.  We  permit  any 
d  =  1,2,...  and  establish  an  upper  bound  on  the  e-metric  entropy  number,  which  is  the  logarithm  of 
the  e-covering  number,  that  is  of  order  0(e^(i(loge_1)‘i+1).  This  upper  bound  is  nearly  sharp  as  we 
construct  a  bounded  set  of  lsc  functions  that  has  e-metric  entropy  number  ce~d\og£~l  for  some  c  >  0. 

It  is  well-known  that  bounded  subsets  of  lsc  functions  are  totally  bounded  under  the  epi-distance 
metric  [20,  Theorem  7.58]  and  consequently  the  covering  numbers  of  such  sets  are  finite.  Here,  we 
establish  for  the  first  time  a  quantification  of  these  covering  numbers.  The  class  of  lsc  functions  is  quite 
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expansive  as  it  includes  functions  defined  on  all  of  lRd  that  might  even  take  on  the  values  Too.  This  class 
is  of  interest  in  various  function  identification  problems  and  their  applications  in  statistics  and  operations 
research  [25,  23,  24],  and  is  fundamental  to  constrained  minimization  problems,  which  abstractly  can 
be  represented  by  lsc  functions  [20].  Since  the  negative  of  a  lsc  function  is  an  extended  real- valued 
upper  semicontinuous  (use)  function,  the  results  developed  here  carry  over  directly  to  bounded  subsets 
of  the  use  functions,  now  under  the  hypo-distance  metric.  Further  applications  arise  in  probability 
theory  because  there  the  hypo-distance  metrizes  weak  convergence  of  distribution  functions  on  lRd , 
which  obviously  are  use  [22].  Thus,  as  an  example,  the  covering  numbers  given  in  this  paper  provide 
directly  covering  numbers  for  bounded  sets  of  distribution  functions  on  Md. 

Technically,  we  rely  on  set-convergence  of  epi-graphs,  coined  epi-convergence  by  R.  Wets  in  [28], 
which  is  quantified  by  a  modified  Pompeiu-Hausdorff  distance  between  subsets  of  ]Rd+1.  This  perspec¬ 
tive  was  placed  on  a  firm  footing  in  [4,  2,  5,  8];  see  also  [6,  9,  10]  for  work  on  the  convex  case.  The 
relevant  results  are  available  in  [20,  Chapter  7],  which  together  with  recent  developments  in  [24,  21], 
provide  the  foundations  for  the  present  derivations. 

After  the  review  of  background  material  in  Section  2,  the  main  theorems  are  stated  in  Section  3. 
Proofs  and  supporting  results  are  given  in  Section  4. 

2  Background 

We  let  lsc-fcns  (H^)  :=  {/  :  Rd  — >  JR :  f  lsc  and  /  ^  oo},  where  JR  :=  JR  U  {— oo,oo}.  Thus,  every 
/  €  lsc-fcns(.ffid)  has  a  nonempty  closed  epi-graph  epi  /  :=  {(x,xo)  €  JRd  x  JR  :  f{x)  <  xo}.  We  adopt 
the  sup-norm  on  JRd,  which  leads  to  slight  simplifications  below,  but  other  choices  would  only  influence 
the  constants  in  the  main  results.  Let  JB(x,p )  :=  {y  €  JRd  :  ||x  —  y||oo  <  p },  plB  :=  lB(0,p),  and 
p§  :=  pJB  x  [— p,  p]  C  JRd  x  ]R.  The  epi-distance  d  is  defined  for  any  f,g  G  lsc-fcns (JRd)  as 

/»oo 

d(f,g)  ■=  /  dp(f,g)e~pdp , 

Jo 

where  the  p-epi-distance,  p  >  0,  is  given  by 

dp{f,g)  ■=  sup  { | dist  (x,  epi  /)  —  dist  (x,  epi  5)  |  :  x  <E  p§}  , 

with  dist  giving  the  usual  point-to-set  distance,  i.e. ,  for  x  =  ( x,xq )  €  Md  x  M, 

dist  (x,  C)  =  inf  {max{||x  —  y||oo>  |®o  —  2/0 1 }  :  ( y ,  yo )  €  C}  if  C  C  lRd  x  Eis  nonempty 

and  dist(x,  0)  =  00.  It  is  clear  that  dp(f,  g)  is  closely  related  to  the  Pompeiu-Hausdorff  distance  between 
epi  /  and  epi  g ,  and  in  fact  equivalent  as  p  tends  to  infinity.  Roughly  speaking,  the  epi-distance  between 
/  and  g  is  the  weighted  average  of  truncated  versions  of  the  Pompeiu-Hausdorff  distance  between  their 
epi-graphs. 

It  is  well-known  that  (lsc-fcns (lRd),d)  is  a  complete  separable  proper  metric  space  [20,  Theorem 
7.58]  (see  also  [24]).  We  recall  that  a  metric  space  is  proper  if  every  closed  ball  in  that  space  is  compact. 
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The  epi-distance  induces  the  epi-topology  on  lsc-fcns (Md),  also  called  the  Attouch-Wets  topology.  For 
fu,  f  €  lsc-fcns (Md),  v  €  IV  :=  {1,  2, }, 

dL(fv ,  f)  —>  0  if  and  only  if  fv  epi-converges  to  /. 

Epi-convergence  neither  implies  nor  is  implied  by  pointwise  convergence.  Uniform  convergence  en¬ 
sures  epi-convergence,  but  fails  to  handle  extended  real-valued  functions  satisfactory — a  necessity  in 
constrained  optimization  problems.  Epi-convergence  ensures  convergence  of  solutions  of  minimization 
problems  (see  for  example  [20,  Chapter  7]  and  [3,  1,  21]).  It  is  therefore  of  particular  importance  in  the 
area  of  optimization  with  numerous  applications  in  machine  learning,  statistics,  and  control,  but  also  of 
significance  in  study  of  partial  differential  equations  where  the  closely  related  notion  of  T-convergence 
appears. 

Following  the  usual  definition  of  covering  numbers,  we  let  for  any  F  C  lsc- fens  (-2?^)  and  e  >  0, 
N(F,e-,d )  be  the  smallest  number  of  closed  balls  in  lsc-fcns(.2?rf)  with  radii  e  that  cover  F. 


3  Main  Results 


In  this  section,  we  establish  lower  and  upper  bounds  on  the  covering  numbers  for  bounded  subsets  of 
lsc-fcns(l?'i).  The  proofs  are  postponed  to  the  subsequent  section. 


3.1  Theorem  (covering  numbers;  upper  bound)  Suppose  that  d  €  fN  and  F  C  lsc-fcns(Md)  is  bounded. 
Then,  there  exist  c  >  0  and  e  >  0  (independent  of  d)  such  that 


log  N(F,e;dL)  < 


for  all  e  €  (0,  e\. 


The  constant  c  depends  on  the  size  of  a  ball  that  contains  the  set  under  consideration,  which  brings 
in  the  need  for  boundedness.  If  the  epi-distance  had  been  defined  using  another  norm  on  Md+1  than  the 
sup-norm,  c  would  have  changed  and  possibly  have  depended  on  d.  Bounded  subsets  of  lsc-fcns(Jid) 
contain  a  wide  variety  of  functions.  For  example,  it  follows  from  Proposition  4.1  below  that  the  set 
{/  €  lsc-fcns  (-ff?d)  :  /( 0)  <  0}  is  contained  in  a  ball  centered  at  the  zero-function  with  radius  one. 
Thus,  this  set  is  bounded  and  can  be  covered  as  stipulated  in  Theorem  3.1.  We  observe  that  the 
significance  of  the  point  0  €  fRd+1  derives  from  its  selection  as  the  center  of  the  ball  pS  in  the  definition 
of  dip.  However,  any  other  point  could  have  been  selected  with  only  trivial  implications. 

Although  a  comparison  to  the  classical  result  of  0(£~d)  for  Lipschitz  continuous  functions  on 
bounded  subsets,  which  goes  back  to  [17]  (see  for  example  [26,  Theorem  2.7.1]),  is  not  entirely  rel¬ 
evant  due  the  different  metrics,  we  note  that  our  bound  is  only  slightly  worse  (a  logarithmic  term) 
for  the  larger  class  of  lsc  functions.  Moreover,  we  do  not  require  any  bound  on  the  variation  of  the 
functions  and  allow  functions  defined  on  all  of  ]Rd,  possibly  extended  real-valued. 

The  proof  of  Theorem  3.1  leverages  recent  approximation  results  for  lsc  functions.  In  [24]  (see  also 
[21]),  we  show  that  lsc  functions  can  be  approximated  by  piecewise  constant  functions  called  epi-splines 
that  resemble  the  simple  functions  of  integration  theory.  The  error  in  approximation,  in  the  epi-distance 
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metric,  can  be  related  directly  to  the  number  of  pieces  in  the  epi-splines.  The  challenge  then  becomes 
that  of  counting  the  number  of  balls  centered  at  epi-splines  that  are  needed  to  cover  a  particular  subset 
of  lsc-fcns(J?d). 

We  next  state  a  lower  bound  on  the  covering  numbers. 

3.2  Theorem  (covering  numbers;  lower  bound)  For  every  d  £  JN,  there  exist  a  bounded  subset  F  C 
lsc-fcns(Md )  and  corresponding  c  >  0  and  e  >  0  (independent  of  d)  such  that 

log  N(F,  e;d)  >  j  log  -  for  all  £  £  (0,  e]. 

In  comparison  with  the  upper  bound  of  Theorem  3.1,  we  see  that  the  lower  bound  differs  by  a 
logarithmic  factor  only.  Let  0  be  the  function  in  lsc-fcns(lRd)  that  is  identical  to  zero  everywhere. 
We  note  that  the  size  of  the  bounded  set  F  in  Theorem  3.2  does  not  have  to  be  large.  In  fact,  an 
examination  of  the  proof  reveals  that  F  might  be  selected  to  have  <1(0,  /)  <  r  for  all  /  £  F,  with  r  >  1 
and  arbitrarily  close  to  1. 

The  proof  of  Theorem  3.2  constructs  a  collection  of  functions  which  is  finite  on  a  grid  of  points  in 
[0,  p]d,  with  p  >  0  and  grid  points  spaced  roughly  e  apart.  At  each  of  these  grid  points,  a  function  takes 
on  one  value  among  a  set  of  discretized  values  between  0  and  p,  again  spaced  roughly  e  apart.  Outside 
these  grid  points,  the  functions  are  infinity.  It  is  clear  that  the  number  of  such  functions  is  ( p/e)n , 
where  n  =  ( p/e)d .  Thus,  its  logarithm  is  of  the  order  0(e~d  loge-1).  The  proof  proceeds  by  showing 
that  no  two  of  these  functions  are  in  a  common  e-ball.  Thus,  it  is  necessary  to  have  a  number  of  balls 
to  cover  F  that  is  at  least  the  same  as  the  number  of  functions  constructed  in  this  manner. 

4  Proofs  and  Supporting  Results 

We  start  this  section  with  estimates  of  the  epi-distance.  An  auxiliary  quantity  is  instrumental.  For 
p  >  0  and  /,  g  £  lsc-fcns(  J?d),  let 

dp(f,9 )  :=  max  |  e  ( epi  f  H  pE>,  epi  gj ,  e(  epi  5  n  pS,  epi  /)  j, 

where  the  excess  of  a  set  C  over  a  set  D  is  given  by 

e(C,D)  :  =  sup{dist(z,  D)  :  z  €  C}  if  C,  D  are  nonempty, 

e(C,  D)  =  oo  if  C  nonempty  and  D  empty,  and  e(C,  D)  =  0  otherwise.  Roughly  speaking,  dLp(f,  g)  is  the 
Pompeiu-Hausdorff  distance  between  epi  /  and  epi  g,  appropriately  intersected  with  pE>.  The  relations 
among  d,  dp ,  and  dp  are  summarized  next.  The  result  is  stated  for  the  Euclidean  norm  on  Md+l  in  [20, 
Exercise  7.60],  but  remains  unchanged  in  the  present  context  of  the  sup-norm  as  established  in  [21]. 

4.1  Proposition  [21,  20,  Exercise  7.60]  For  f,  g  £  lsc-fcns(lRd )  and  p  >  0,  the  following  holds,  where 
we  use  the  notation  Sf  =  dist(0,epi/)  and  similarly  for  g: 

0)  dP(f,  g)  <  dp(f,  g)  <  dp'if ,  g)  for  p'  >  2p  +  ma x{Sf,  5g }; 
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(ii)  dp(f,  g)  <  rnax{d/;  <5g}  +  p; 

(in)  d(f,g)  >  (1  -  e~p)\ Sf  -  5g\  +  e~pdp(f,g); 

(iv)  d(f,  g)  <  (1  -  e~p)dp(f,  g)  +  e~p[max{Sf,  5g}  +  p  +  1]; 

(v)  | Sf  -  5S|  <  d(f,g)  <  max{Sf,Sg}  +  1. 

The  next  result  is  essentially  a  direct  consequence  of  Proposition  4.1. 

4.2  Proposition  For  /  €  lsc-fcns(IRd)  and  r  >  0,  d( 0,  /)  <  r  implies  dist(0,epi/)  <  r. 

Proof.  Proposition  4.1(v)  gives  that  r  >  d( 0,  /)  >  |  dist (0,  epiO)  —  dist(0,  epi /) |  =  dist(0,  epi  /).  □ 

We  rely  on  a  “discretization”  of  lsc  functions  in  terms  of  epi-splines  [24,  21]  and  adopt  the  notation 
cl  A  for  the  closure  of  a  subset  A  of  a  topological  space.  Moreover,  for  any  /  :  ]Rd  — >•  1R  and  x  €  lRd ,  let 
liminfx/^.x  f(x')  :=  lim^0  inf x'eB(x,S)  fix')-  Epi-splines  are  defined  in  terms  a  finite  collection  of  subsets 
of  ]Rd.  A  finite  collection  R±,  f?2, ...,  Rk  of  open  subsets  of  lRd  is  a  partition  of  Md  if  u|h1  cl Rk  =  ]Rd 
and  Rk  n  Ri  =  0  for  all  k  /  l.  Specifically,  an  epi-spline  s  :  lRd  — >•  M,  with  partition  1Z  =  {Rk}k=i  °f 
]Rd,  is  a  function  that 


on  each  Rk,  k  =  1, K,  is  constant  real  number, 
and  for  every  x  €  Md,  has  s(x)  =  liminfx/^x  s(x/). 

The  family  of  all  such  epi-splines  is  denoted  by  e-spl(F-).  The  ability  of  epi-splines  to  approximate  lsc 
functions  is  established  by  the  next  result;  see  [24,  21]  for  further  information. 

4.3  Proposition  For  a  partition  1Z  =  {Ffc}^=1  of  lRd  and  p  >  0,  we  have  that  for  every  f  € 
lsc-fcns(lRd) ,  there  exists  an  s  €  e-spl(7£)  such  that 

dp(s,  f)  <  pp(K)  :=  inf  {77  >  0  :  Rk  C  fB(x ,  77)  for  all  x  €  p]B  and  k  satisfying  x  €  clF^}. 

If  p,p( TZ)  <  p,  then  s  can  be  taken  to  satisfy  —p'  <  s(x)  <  max{— p' ,  minjp',  f(x)}}  for  any  p'  >  p  and 
x  €  Md. 

Proof.  The  first  part  of  the  proposition  is  a  direct  application  of  [21,  Theorem  5.9].  The  fact  that  s  can 
be  taken  to  satisfy  —p'  <  s(x)  <  max{— p' ,  minjp',  f(x)}}  for  any  p'  >  p  follows  from  an  examination 
of  that  theorem’s  proof.  □ 

4.4  Proposition  [24,  Theorem  3.17]  If  s,s'  €  e-spl({Ffc}^=1),  then 

d(s,s')<  max  sup  Is (x)  —  s' (x) I . 

k=i,...,K  xeRk' 
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We  are  then  ready  to  give  proofs  of  the  main  results. 


Proof  of  Theorem  3.1.  Since  F  is  bounded,  there  exists  an  r  >  0  such  that  /  €  F  implies  that 
<1(0,  /)  <  r.  Let  71, 72, 73  >  0  be  such  that  71  +  72  +  73  =  1.  Set  s  G  (0, 1)  such  that 
c2J(jt  — |—  \  \  v 

-  log  -  +  log - b  -  +  log  (r  +  1)  -  1  >  72 £  for  all  e  G  (0,  e]. 

r  [  e  71  2 

Fix  e  G  (0,  e]  and  define  p  to  be  the  expression  on  the  left-hand  side  of  the  previous  inequality.  We  next 
construct  a  partition  of  Md  and  set  u  >  1  and 


where  [a]  is  the  smallest  integer  no  smaller  than  a.  The  partition  is  obtained  by  dividing  the  ball 
1B{ 0,  c up)  =  [—up,  up\d  into  nd  balls  of  equal  size.  Specifically,  let  K  =  nd  + 1  and  R.j~ .  k  =  1,2,...,  nd ,  be 
the  collection  of  nonoverlapping  open  boxes  of  the  form  J^=1(Z  f,uf),  with  if,  uf  G  ]R,  uf  —  lf  =  2up/n, 
if  =  2(k  —  l)up/n  —  up,  k  =  1  ,...,n,  and  U^Tj1  cl  =  [— t op,ojp]d.  Also,  Rk  =  Sid\  [— up,  up]d.  We 
denote  by  1Z  =  {Rk}k=\  this  partition.  Clearly,  pp{!Z)  =  2 up/n.  Next,  we  consider  a  discretization  of 
parts  of  the  range  of  lsc  functions  and  set 


+  1. 


The  points  a3  =  —up  +  2(j  —  1  )up/(m  —  1),  j  =  1,2,  ...,m,  discretize  the  interval  [—up, up].  The 
epi-splines  in  e-spl(T^)  that  take  on  one  of  these  m  values  on  each  RP  is  a  collection  of  mK  unique  epi- 
splines.  Let  S  C  e-spl(72.)  be  this  collection  of  mK  epi-splines.  That  is,  s  €  S  if  for  every  k  G  {1, ...,  K}, 
there  exists  a  jk  €  {1,  such  that  s(x)  =  ajk  for  x  G  Rk-  We  now  show  that 


\jB(8,e)DF. 

s£S 


Let  /  G  F  be  arbitrary.  By  Proposition  4.3  and  the  fact  that  pp(R)  =  2up/n  <  726  <  p,  there 
exists  sq  G  e-spl(7£)  such  that 


dp(f,  sq)  <  pp(R)  and  —  up  <  so(x)  <  max{— up,  minjccp,  f(x)}}  for  x  G  Md. 


Proposition  4.2  ensures  that  dist(0,  epi  /)  <  r.  Thus,  there  exists  an  x  such  that  ||x||oo  <  r  and  f(x)  <  r. 
Consequently,  so(x)  <  max{— up,  min{up,  f(x)}}  <  r.  So  we  also  have  that  dist(0,  epi  so)  <  r. 

Since  £,71  <  1, 

2 (r  +  1)  rr  ,  ,  7  2(r  +  1)  ,  , 

p  >  -  -  +  log(r  +  1)  —  1  =  r  H - -  log(r  +  1)  >  r. 

r  L2  J  r 

Thus,  using  the  notation  p  =  (p  —  r)/2,  Proposition  4.1  gives  that 


d(f,  s0)  <  (1  -  e~p)dp{f ,  s0)  +  e~p(r  +  p+  1) 

<  dP(f,  «o)  +  e~p{r  +  p  +  1) 

<  pp{TZ)  +  e~p{r  +  p+  1) 

=  2up/n  +  e~p{r  +  p  +  1). 
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In  view  of  Proposition  4.4,  there  exists  s  £  S  such  that  d(s,so )  <  up/(m  —  1)  since  we  can  select  s 
such  that  |s(x)  —  so(x)|  <  ujp/(m  —  1)  for  all  x  €  lRd.  The  triangle  inequality  then  gives  that 

d(f,  s )  <  ujp/(m  —  1)  +  2 ujp/n  +  e~p{r  +  p  +  1). 


It  remains  to  show  that  the  right-hand  side  is  less  than  e.  We  start  with  the  last  term.  By  concavity 
of  the  log-function,  we  have  that 


Consequently, 


log 


-{p  +  r)  +  1 


<  log  (r  +  1)  + 


p  —  r 
2r  +  2 ' 


log  [e  p(r  +  p  +  1)]  =^(r-p)  +  log  Q(p  +  r)  +  1^ 


<\{r~  p)  +  log  (r  +  1)  + 


r  r(p  +  1) 
~  2~  2(r  +  1) 
=  log7i£, 


+  log(?’  +  1) 


where  the  last  equality  follows  from  inserting  the  expression  for  p.  Thus,  e  P(r  +  p+  1)  <  716.  We  then 
examine  the  second  term.  Inserting  the  expression  for  n,  we  obtain  that 


2c jp 


n 


<  l2t- 


Finally,  we  consider  the  first  term.  In  view  of  the  definition  of  m,  we  have  that 

cop 


m  —  1 


<  73£- 


Thus,  d(f,s)  <  e.  Since  /  is  arbitrary,  we  have  established  that  U S&s^{s,e)  covers  F.  The  logarithm 
of  the  number  of  functions  in  S  is  ( nd  +  1)  logm.  At  this  point,  the  order  of  the  result  is  immediate.  A 
possible  expression  for  the  constant  c  is  obtained  as  follows.  Let  ci  =  2 (r  +  l)/r  and 


C2 


2(r  +  l) 


1  r 

log - h  -  +  log  (r  +  1) 

7i  2 


-  1. 


Thus,  p  =  c\  log  £  1  +  C2 .  Moreover,  let  C3  =  2uj/^2  and  C4  =  cu/73.  Using  these  expressions,  we  find 
that 


( nd  +  1)  logm  < 


^cic3  + 


C2C3  +  1 
logs-1 


+  1 


log 


^ClC4  + 


C2  C4  +  2  \ 

log  £~ 1  ) 


—  log  — 
£  £ 


Let 


.  02C3  +  1 

C5  =  cic3  +  - — r  and  cq 

logs-1 


C1C4  + 


C2C4  +  2 

logs-1 
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We  then  find  that 


(n“  +  1)  log  m  <  c7  (  -  log  - 


d  r 


Using  the  fact  that  log  log  e  1/loge  1  <  e  1  for  e  €  (0, 1),  we  obtain  that 


log  c6  +  log  -  +  log  log  - 

£  £ 

-1 


,  where  c7  =  c5  +  — - — p 

£~l  logs-1 


( nd  +  1)  logm  <  C7 


J^V  +  l+e-1 

logs-1 


d+1 


which  gives  a  particular  expression  for  c  in  the  theorem  statement.  Since  the  choice  of  e  is  independent 
of  d,  this  c  is  independent  of  d.  □ 

Proof  of  Theorem  3.2.  Let  p  >  0  and  F  =  {/  G  lsc-fcns(J?a!)  :  f{x)  <  p  for  at  least  one  x  G 
[0,  /o]rf}-  We  show  that  F  cannot  be  covered  with  a  lower  number  of  balls  than  stipulated.  Clearly, 
dist(0,epi/)  <  p  for  all  /  G  F.  Thus,  in  view  of  Proposition  4.1(v),  d(0, /)  <  p  +  1  for  all  /  G  F  and 
F  is  therefore  bounded. 

Next,  let  s  G  (0,pe_p/6].  We  discretize  [0, p]d  by  defining  x\  =  kp/ne,  k  =  l,...,n£  —  1  and 
i  =  1 , . . . ,  d,  where 


nF  = 


pe 


-p 


3e 


>2, 


with  [uj  being  the  largest  integer  not  exceeding  a.  The  discretization  of  [0 ,  p]d  then  contains  the  points 
(xi1,  a?22, ...,  Xjd),  with  ki  £  {1,  2, ...,  n£  —  1}  and  i  =  1, ...,  d.  Clearly,  the  distance  between  any  two  such 
points  in  the  sup-norm  is  at  least  p/ne  >  3eep.  We  carry  out  a  similar  discretization  of  [0,  p]  and  define 
yl  =  lp/ne ,  l  =  1, ...,  ne.  The  functions  that  are  finite  on  the  discretization  points  of  [0,  p]d,  with  values 
at  each  such  point  equal  to  yl  for  some  l ,  and  have  value  infinity  elsewhere  are  given  by  S£.  i.e., 


S£  ={/  €  lsc-fcns(l?a!)  :  for  each  x  =  (xf1, ...,  xkdd),  with  ki  G  {l,2,...,n£  —  1  },/(x)  =  y 
for  some  l  =  1, ...,  n£;  f(x)  =  00  otherwise}. 


Certainly,  Se  C  F.  We  next  define 

Ge(f)  =  {g  G  lsc-fcns(12<i)  :  c£p(f,g )  <  £ep},  f  G  lsc-fcns(l?d). 

We  establish  that  Gs(f )  fl  G£(/')  =  0  for  /,  f  G  Se,f  /  f .  Suppose  for  the  sake  of  a  contradiction 
that  there  is  a  g  with  g  G  Ge(f)  and  g  G  Ge(f)  for  /,  f  G  Se,  f  /  f.  Then,  g)  <  £ep  and 

dpif'id)  <  £ep.  However,  since  /  /  /',  there  exists  a  point  x  G  [0 ,  p\d  with  \f(x)  —  f(x)\  >  3eep. 
Without  loss  of  generality,  suppose  that  f(x)  <  f'(x)  —  3£ep.  Since  f(z),f'(z)  =  00  for  all  z  /  x  with 
|| z  —  x||oo  <  3eep,  we  have  that  dp(f,g )  <  £ep  implies  that  g(z)  <  f(x)  +  £ep  for  some  z  G  lB(x,£ep). 
Moreover,  dLp(f',g)  <  £ep  implies  that  g(z)  >  f'(x)  —  £ep  >  f(x)  +  3eep  —  £ep  =  f(x)  +  2eep  for  all 
z  G  dB(x,  £ep).  Since  this  is  not  possible  for  g,  we  have  reached  a  contradiction.  Thus,  Gs(f)r\Ge(f)  =  0 
for 

By  Proposition  4.1  (i,iii) ,  for  any  /  G  lsc-fcns (Md), 


d(f,g )  >  e  pdp(f,g)  >  e  pdp(f,g)  >  e  p£ep  =  £  for  all  g  0  Ge(f). 


Hence,  for  f  G.  Se,  an  e-ball  that  contains  /  needs  to  be  centered  at  some  g  €  Ge(f).  Since  the  sets 
Ge(f),  f  €  S£,  are  nonoverlapping,  a  cover  of  S£  by  e-balls  must  involve  a  number  of  balls  that  is  at 
least  as  great  as  the  number  of  functions  in  S£,  which  is  n™®,  where  me  =  (ne  —  l)rf.  Thus, 


log  N (F.  e;  d)  >  nd  log  n£  > 


-p 


pe 

3e 


-2  log 


-p 


pe_ 

3e 


-  1 


Let  ci  =  |  log(pe  p / 4)  |  and  e  =  minjpe  p / 12,  e  2ci}.  Continuing  form  the  previous  inequality,  we  then 
find  that 


log  N(F,e;d)  > 


Pe~ 


_  —  n  \  d 


6 


1  + 


log  {pe  p / 4) 

loge 


-l 


1  ,  1 


Since  loge  1  >  2|  log(pe  p/4)|  for  e  €  (0, e] ,  we  have  that 


logA^F,  £]d)  > 

and  the  conclusion  is  reached. 


pe 


~p\d  i  i 


--Hog-  for  e  €  (0, e], 


2e 
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